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Abstract: Precision agriculture considers reference evapotranspiration (ETo) as a 
significant meteorological parameter for estimating crop water need and thus planning 
precision irrigation. This paper proposes a stacked bi-directional long short-term 
memory (BLSTM) framework for the hourly forecasting of ETo. The recursive feature 
elimination (RFE) process was employed to select the optimal feature lag for the model 
construction. Single and multi-step ahead ETo forecasting models with stacked BLSTM 
and LSTM architectures were developed using RFE suggested 24 feature lags as input 
and the prediction horizons of 1, 6, 12, and 24. The evaluation process verified the 
efficiency of stacked BLSTM models over stacked LSTM models in all prediction 
horizons. The study reported the best accuracy for the single-step ahead ETo 
forecasting model, BLSTM 24-1, with a mean absolute error (MAE) of 0.94 mm/hour, a 
root mean squared error (RMSE) of 1.7 mm/hour and a coefficient of determination 
(R2) of 0.91. Though the shorter horizon improves the forecasting accuracy, the 
negligible variations in the performance metrics observed in the multi-step ahead 
models, such as 1.31-1.11 mm/hour in MAE, 2.38-2.11 mm/hour in RMSE, and 0.83- 
0.86 in R2, emphasized the supremacy of stacked BLSTM architecture in ETo hourly 
forecasting, irrespective of the prediction horizon. Hence, in addition to the most 
reliable one-step ahead BLSTM model, this study also recommends the 24 hours ahead 
BLSTM model as a daily irrigation planning tool for agriculturists by aggregating its 
output time series. 


INTRODUCTION 


need for a large number of meteorological variables for its 
estimation, as well as their unavailability in many areas, 


Peau the world’s most water-intensive activity [1], 


is currently looking for some innovative precision irrigation 
techniques that promote wise water usage and assist 
farmers in achieving higher levels of crop yield in a lesser 
amount of water. The precise estimate of the amount of 
water needed for a crop to grow serves as the basis of 
precision irrigation techniques [2], and it is closely related 
to the term evapotranspiration (ET) in the hydrologic cycle 
[3]. The combined processes of surface evaporation and 
plant transpiration define the evapotranspiration [2], anda 
variant of it, the reference evapotranspiration (ETo,) is a 
meteorological parameter for calculating the actual ET of 
different crops [4]. 

ETo can be directly measured with a lysimeter, but 
its complexity necessitates the development of various 
empirical equations to estimate ETo [5]. Among these, the 
Penman-Monteith equation (FPME) developed by the Food 
and Agriculture Organization of the United Nations is 
considered as the standard ETo estimation method [6]. The 


paved the way for the use of artificial intelligence (AI) 
techniques in ETo modelling [7]. Several minimal input 
parameter-based ETo modelling studies have taken 
advantage of the ability of AI algorithm to portray the 
nonlinear relation between input parameters and ETo [8]- 
[10]. Moreover, the short-term or long-term forecasting of 
ETo will help agriculturists to plan irrigation and manage 
their water supplies more effectively [11]. However, 
majority of ETo modelling research has focused on the 
estimation of ETo using multiple input variables such as 
temperature, humidity, wind speed and solar radiation [5]. 
Univariate analysis and time series forecasting studies 
based on ETo, resulting in multiple steps ahead ETo time 
series values, are limited in the literature. 


Related Work 

The deep learning-based time series data analysis of ETo is 
emerging in the ET domain because of its competence to 
learn the ETo data pattern. ETo forecasting studies using 
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time-lagged recurrent neural networks (TLRNN) [12], 
encoder-decoder long short-term memory (LSTM) [13], 
bidirectional LSTM (BLSTM) [14]- [16], hybrid and 
ensemble deep learning architectures [11], [17] 
demonstrated this. Proias et al. [12] made a one to three 
days ahead ETo forecast using TLRNN. According to the 
multi-step ahead daily ETo forecasting study report of 
Ferreira and Cunha [11], the Convolutional LSTM hybrid 
model performed well in local and regional scenarios. In 
another study [18], they tried to estimate daily ETo from 
various input parameters measured hourly and reported a 
fair result. Using a multivariate hybrid BLSTM model, Yin et 
al. [15] successfully forecasted a short-term, 1-7 day ahead 
ETo time series. Roy [16] investigated the potential of 
BLSTM in the one-step ahead forecasting of daily ETo time 
series. However, apart from these few studies, the analysis 
of deep learning architectures in ETo modelling, especially 
in time series forecasting, is very limited. 

Previous ETo time series forecasting studies have 
produced results ranging from shorter forecast periods of 
one to seven days [19], [20] to longer forecast horizons of 
several months, spanning one to several steps forward [15], 
[21]- [25]. Most of them have used neural networks in their 
modelling. Manikumari et al. [26] employed boosted and 
bagged neural networks in the daily ETo time series 
forecasting up to 365 days and proved its success in 
comparison with artificial neural network (ANN) models. 
Nourani et al. [21] observed an accurate single and multi- 
step ahead ETo forecast using monthly time series inputs 
with ensemble models. Gocic and Amiri [22] investigated the 
effect of using the optimal input time lags for the prediction 
of monthly ETo using ANN. Certain studies have conducted 
multi-step ahead ETo forecasting with prediction horizons 
such as 1-30 days [27] and 16 days [28]. To the best of 
author's knowledge, no studies on hourly ETo time series 
forecasting to generate one-step or multi-step ahead ETo 
forecasts have been identified in the literature to date. 

The ability of the LSTM and its derivatives to learn 
long-term input-output dependence [29] propelled them to 
the top among forecasting research. Additionally, the BLSTM 
architecture can retain both past and future time series in 
forecasting modelling, resulting in more accurate results 
[15]. Hence, we opted to experiment with stacked BLSTM 
architectures for the proposed investigation of hourly ETo 
forecasting. 

An optimal input feature or lag of a time series 
forecasting model is a critical factor for determining its 
accuracy [22]. The recursive feature elimination (RFE) is a 
feature selection method that seeks an optimal time lag by 
recursively eliminating least important features of the 
model [30]. Some forecasting studies implemented it [30], 
[31], but ETo forecasting studies never attempted RFE for 
optimal feature selection. Summarizing the aforementioned 
points, the objectives of the proposed study are 

¢ to design and implement stacked BLSTM 
architecture for the single and multi-step ahead 
time series forecasting of ETo on an hourly basis, 

e to select the optimum feature lag for the proposed 
model by using RFE feature selection methodology, 
and 

¢ to evaluate the efficiency of the proposed method in 
contrast to the stacked LSTM architecture. 


METHODS AND MODELS 


Data set and data preparation 

The present study used the hourly ETo time series data 
estimated from the meteorological data such as 
temperature, humidity, net solar radiation, and wind speed. 
The hourly meteorological data from November 2015 to 
February 2021 was collected from the Advanced Centre for 
Atmospheric Radar Research (ACARR), Cochin University of 
Science and Technology in Kochi, Kerala, India. The ASCE- 
Penman Monteith equation (APME) was applied to 46324 
data records to estimate hourly ETo. APME is a variant of 
FPME for estimating ETo on an hourly basis [18] and is given 


by (1). 


0.408 A(R, —G) + v( 


37 
Tr 1.972 u2 (e, ~~ €q) 
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A+y(1 + Cyuz) (1) 

where ETo is the reference evapotranspiration (mm/hour), 
A is the slope of vapour pressure curve (kPa/0C), y is the 
psychometric constant (k PaO C-1), R_n is the net solar 
radiation (MJm-2hour-1), ), e_s is mean saturation vapour 
pressure(kPa), e_a is actual vapour pressure (kPa), G is the 
soil heat flux (MJm-2hour-1 ), T_m is the daily maximum 
temperature (0C), u_2 is the wind speed at 2m height (m/s), 
C_his a constant with a value 0.24 during day time and 0.96 
during night time. 
To ensure faster learning and convergence, min-max 
normalization was applied to the input time series data for 
model training. Missing time series values were imputed 
using interpolation technique. Statistical properties of the 
processed data are listed in Table 1. 


Table 1: Statistical description of the ETo time series data 


Standard 


Count Max Min Mean ae Unit 
deviation 
46324 244 -2.0 2.4 49 Millimeter/ 
hour 


RFE feature selection 

A well-designed feature selection process often improves 
the precision of a machine learning forecasting model. In the 
univariate forecasting approach, the observations in the 
previous time steps or lag values form the input features of 
the model. Because the lag determines the length of 
historical time series records, an optimal selection of it often 
results in a suitable input vector to the forecasting model, 
reducing computation time and increasing forecasting 
precision [22]. 

RFE is a wrapper methodology that trains the 
forecasting model with the original feature set of a certain 
time steps and ranks the lag features by significance [30]. 
The pruning process is then carried out on the lags with the 
lowest score. This process is repeated with a progressively 
smaller subset, which is reduced by the desired number of 
features [31]. 

This study used RFE to perform experiments for 
optimum lag selection, by setting the original feature set 
with 1-to-24-time steps and the desired number of input 
features to 1. The best lag values obtained is 24, as seen in 
Fig 1. Fig 1(a) depicts the correlation of each lag observation, 
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while Fig 1(b) depicts the feature selection rank (smaller is 
better) for each input feature. 


Proposed Stacked BLSTM framework 

BLSTM is a deep learning architecture consisting of two 
separate recurrent neural networks in charge of preserving 
past and future time series information. Unlike 
unidirectional LSTM, which only considers the previous 
history of observations, BLSTM processes time series 
observations in both forward and backward directions [14]. 
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The forward and backward layers do not allow information 
to flow in between, but both direct the information to the 
output layer. The individual forecast of each BLSTM unit is 
aggregated to determine the final forecast. Because BLSTM 
is an extended version of LSTM, it inherits its ability to 
capture the long-term interactions among the time series 
data by avoiding the vanishing or exploding gradient issues 
[15]. The structure of BLSTM architecture is depicted in Fig 
2. 


30 
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Fig 1: Optimal feature lag selection using (a) correlogram and (b) RFE from the ETo time series dataset 


Fig 2: Structure of BLSTM model 
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The BLSTM calculates the final output y, by aggregating the 
forward layer output, ht and backward layer output, he. 
X14, Xp, and x,,, are the input time lag values, and y,_,, y;, 
and y;,, are the output time lag values. The information flow 
between the layers can be mathematically represented as 
(2), (3), and (4). 


hy = f(wixe ot Wohy-1) (2) 
hy = f (wax + Welte+1) (3) 
Vt = g (wah; an Weh) (4) 


where w_1 and w_3 are input to hidden layer (both forward 
and backward) weights, w_2 and w_5 are the weights 
between hidden layers, and w_4 and w_6 are hidden layer to 
output layer weights. 

In this study, the characteristics of a BLSTM unit has 
been exploited to achieve an improved ETo forecast by 
stacking multiple units of it. The proposed stacked BLSTM 
framework for the hourly ETo forecasting for both single 
and multiple step ahead contains two stacked BLSTM layers, 
each one with 50 units along with a dropout layer with 0.2 
dropout rate. During the model's training phase, the ReLu 
activation and Adam optimization algorithms were used. 
The best results were obtained for a batch size of 256, a 
learning rate of 0.001, and an epoch size of 200. All of these 
parameters were chosen using a trial-and-error strategy, 
and the results of such experiments are discussed in section 
IV. 


ETo forecasting strategies for the proposed model 
implementation 

The proposed research considers two strategies for ETo 
forecasting: one step ahead and multi-step ahead, both 
experimenting with optimal input time lag of 24 according 
to the findings described in section II. The prediction 
horizon for multi-step ahead ETo forecasting is randomly 
chosen as 6 hours, 12 hours, and 24 hours, while the 
prediction horizon for the single step forward is one. In 
addition to the proposed stacked BLSTM model, a 
unidirectional stacked LSTM model has also been designed 
and tested for performance evaluation. The entire time 
series data was split in a 70:30 ratio for model training and 
testing. 


Both the stacked LSTM and BLSTM architectures were 
tested with one to three stacked layers with a search space 
of 10, 20, 30, 50, 70, and 100 units. To prevent overfitting 
problems, the dropout layers were used in the architecture 
with an experimented dropout rate of 0.1, 0.2, and 0.3. The 
batch sizes tested were 128, 256, and 516, and an overall 
epoch of 300 with early stopping was also used. The built 
models are labelled according to the architecture used, the 
number of input time lags, and the prediction horizon length. 
The values of hyper parameters and layer configuration of 
the proposed and evaluation models are listed in Table 2. 


Performance evaluation criteria 


The proposed stacked BLSTM models for both one and 
multi-step forward ETo forecasting were compared to the 
stacked LSTM models using the error indicators such as 
mean absolute error (MAE), root mean square error (RMSE), 
and the coefficient of determination (R?) [11]. The 
proficiency of the constructed models is characterized by a 
lower value of MAE and RMSE and a higher value of R2. These 
statistical performance indicators are represented in (5), 
(6), and (7). 


MAE = -Y|ET, — ET,| (5) 

RMSE = A x(ET, SER (6) 
2 

R2- Y(ETp-ETp)(ETq-ETa) (7) 


| X(ET -ET)” N(ETa-ETa)? 


where n is the number of data records, ET, is the predicted 
ETo values, ET, is the actual ETo values, EL, is the mean of 
predicted ETo values, ET, is the mean of actual ETo values. 


RESULTS AND DISCUSSIONS 


This section analyses the performance of the proposed 
stacked BLSTM ETo forecasting model architecture to the 
stacked LSTM architecture in both single step ahead and 
multi-step ahead forecasting scenarios. For the design of the 


Table 2: Layer configuration and hyper parameter settings of the built models 


Model Horizon 
ecchibecture length Model name Hyper parameter Value 
Stacked layers 2 
1 BLSTM 24-1 Units 50 
6 BLSTM 24-6 
Stacked BLSTM 12 BLSTM 24-12 Propout oe 
24 BLSTM 24-24 aueSe 
Epoch 300 
1 LSTM 24-1 Stacked layers 2 
Units (100, 50) 
Stacked LSTM . pee D t 0.2 
ea 12 LSTM 24-12 ee = 
24 LSTM 24-24 eae 
Epoch 300 
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Table 3: Performance metrics of ETo forecasting models during the testing period 


Forecast . BLSTM LSTM 
Horizon 
strategy MAE RMSE R2 MAE RMSE R2 
One step 0.94 1.7 0.91 0.99 1.8 0.89 
1.11 2.11 0.86 1.33 2.26 0.84 
Multi step 12 1.3 2.31 0.84 1.39 2.44 0.81 
24 1.31 2.38 0.83 1.4 2.5 0.8 


models, the ETo time series data was pre-processed with 
time steps of 24 observations looking backward and 1-, 6-, 
12-, and 24-time steps looking forward. The assessment 
metrics derived from evaluating the models with test data 
are listed in Table 3 and illustrated graphically in Fig 3. 


Performance evaluation of the proposed model in single 
step ahead forecasting 

All the performance indicators in Table 3 denote the 
superiority of BLSTM 24-1 over LSTM 24-1. The percentage 
decrease of 5 in MAE, 5.6 in RMSE, and a percentage increase 
of 2.3 in R2 of stacked BSLTM compared to LSTM 24-1, 
demonstrate its effectiveness in hourly and single step 
ahead ETo forecasting. 


Performance evaluation of the proposed model in multi- 
step ahead forecasting 

All the multi-step ahead forecasting models based on the 
stacked BLSTM architecture (BLSTM 24-6, BLSTM 24-12, 
and BLSTM 24-24) outperformed the stacked LSTM 
forecasting models (LSTM 24-6, LSTM 24-12, and LSTM 24- 
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24). In comparison to LSTM 24-6, BLSTM 24-6 showed a 
percentage decrease of 16.5 in MAE, 6.6 in RMSE, and a 
percentage rise of 2.4 in R2. Both BLSTM 24-12 and BLSTM 
24-24 exhibited a similar percentage of decrease of 6.4 in 
MAE, and 3.7 percentage of increase in R2, but a different 
percentage decrease of 5.3 and 4.8 in RMSE compared to 
LSTM 24-12, and LSTM 24-24. These findings demonstrate 
the efficacy of stacked BLSTM architecture in multi-step 
ahead hourly ETo forecasting. 


Result analysis 

Regardless of the forecasting strategies, the stacked BLSTM 
architectures proved its ability in the hourly forecasting of 
ETo in this research. Fig 4, which shows the comparison of a 
sample forecasted and actual ETo time series, substantiates 
this. For all prediction horizons, BLSTM forecasted ETo 
values have a close correlation with actual ETo values in 
comparison with the LSTM forecasted ETo. The plot also 
highlights that the accuracy of the prediction decreases with 
the increase in the forecasting horizon length. 


Forecast horizon 


(a) MAE 


{ mmm BLSTM 
mm LSTM 


RMSE (mm/hour) 


6 


Forecast horizon 


(b) RMSE 


Sowmya et al, A stacked BLSTM framework for the hourly forecasting of ETo. J Biol Engg Res & Rev, Vol. 8, Issue 2, 2021 


87 


1 6 


mmm BLSTM 
mam LSTM 


Forecast horizon 
(c) R? 
Fig 3: Comparison of the performance evaluation metrics for stacked BLSTM and LSTM models for single and multi-step 


ahead forecasting 
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Fig 4: Comparison of predicted and actual hourly ETo time-series values using stacked BLSTM and LSTM models for single 
and multi-step ahead forecasting 


From the Table 3, the lowest values of MAE (0.94 mm/hour) 
and RMSE (1.7 mm/hour) as well as the highest value of R2 
(0.91) were observed with the horizon length of 1, and it 
gradually changed to 1.31 mm/hour of MAE, 2.38 mm/hour 
of RMSE and 0.83 of R2 corresponding to the metric values 
of BLSTM 24-24. Although there exists a_ significant 
difference (15-28 % decrease in MAE, 19-28 % decrease in 
RMSE, 6-10 % increase in R2) between the single and multi- 
step forward forecasting performance, there is only a 
marginal performance variation among the multi-step 
ahead models. 

Previously published ETo forecasting studies 
employed FPME recommended daily ETo values as input 
series to generate single and multi-step ahead daily and 
monthly ETo time series. The deep learning research 
conducted by Ferreira and Cunha [18] experimented with 
various hourly measured input data combinations to 
estimate daily ETo both directly and indirectly (by adding 


the predicted hourly ETo for 24 hours). It proved the efficacy 
of hourly input data in the accurate estimation of daily ETo. 
Hence, in addition to recommending a stacked BLSTM- 
based, single-step-ahead ETo forecast model, the present 
study reached another conclusion by relating the findings of 
the research of Ferreira and Cunha [18]. That is, BLSTM 24- 
24 is the most significant of the developed multi-step-ahead 
hourly ETo forecasting models because it can forecast the 
next 24 hours of ETo values, leading to the daily ETo forecast 
by summing its output forecast. 


CONCLUSION 


ETo hourly forecasting is a novel field of research in the ET 
domain. This paper investigates the capability of the stacked 
BLSTM deep learning framework in the hourly forecasting 
of ETo, incorporating single and multi-step ahead 
forecasting horizons. An optimal lag of ETo time series for 
the reliable forecasting was identified by applying the RFE 
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feature selection method. Using an optimal lag of 24 
observations from the ACARR time series data set, four 
models with different forecast horizons of 1, 6, 12, and 24 
were designed, each using both stacked BLSTM and LSTM. 
When the efficiency of the designed architectures was 
evaluated using various statistical assessment criteria, 
stacked BLSTM surpassed stacked LSTM in both single and 
multi-step ahead ETo forecasting. Of the developed models, 
the single step ahead ETo forecasting model showed the best 
forecasting accuracy. In addition to the observation that a 
shorter forecasting horizon improves forecasting precision, 
multi-step ahead models have provided comparable 
accuracy to single-step models, with no substantial 
difference in forecasting efficiency between them. 
Furthermore, by aggregating its output forecast of 24 ETo 
values, the BLSTM 24-24 model can be used as a daily ETo 
calculator, assisting agriculturists in precisely planning 
irrigation. 
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